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The Dirac equation without spinors 

Daniel Elton, Dmitri Vassiliev 


In the first part of the paper we give a tensor version of the Dirac equation. In the 
second part we formulate and analyse a simple model equation which for weak external 
fields appears to have properties similar to those of the 2-dimensional Dirac equation. 


1. Introduction 


The paper is an attempt at analysing the geometrical meaning of the Dirac 
equation 


7 *(iV — eA)^ = mil), 


( 1 . 1 ) 


and at presenting this geometry in a mathematical language understandable to 
specialists in partial differential equations. The paper consists of two parts. 

In the first part we give a tensor interpretation of bispinors (Section 3) and refor¬ 
mulate the Dirac equation in tensor form (Section 4). In Section 5 we discuss the 
simplifications which occur when we move from M 4 (3-dimensional Dirac equa¬ 
tion) to M 3 (2-dimensional Dirac equation). Note that the fact that the Dirac 
equation can, in principle, be written in tensor form is not new; see, for example, 


In the second part we adopt the point of view that the Dirac equation is not a 
precise equation, but a first approximation in an asymptotic process described by 
Feynman diagrams, and it should be used only when the external field A is weak, 
smooth, and slowly varying. We suggest the following model equation in M 3 : 


curl^ u = ±mu. 


( 1 . 2 ) 


Here u is a complex valued vector function, and curl^ is the usual curl corrected 
by t he electromagnetic field; see next section for precise definition. We show that 
(1.2) has properties very similar to those of the two-dimensional Dirac equation. In 
Section 6 we give a mathematical meaning to “weak, smooth, and slowly varying” 
by introducing asymptotic scaling in terms of a small dimensionless p aram eter a 
(fine structure constant) and perform a formal asymptotic analysis of (1.2). This 
formal asymptotic analysis reduces (0) to the Pauli equation. Mathematically 
rigorous (but, inevitably, more restrictive) results are presented in Sectio n 7 . 

Finally, in Section 8 we point out the remarkable similarity between GJ) and 
the Maxwell equations. 
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2. Principal notation 


Our notation mostly follows LL4 ], with minor modifications. 

By M” we denote Minkowski n-space with temporal coordinate x °, spatial 
coordinates (a; 1 ,... , x n ~ 1 ), and metric g = diag(+l, —1,... , —1). By M m 
we denote Euclidean m-space with spatial coordinates (x 1 ,... ,x m ), and met¬ 
ric diag(—1,... , — 1). We use Greek letters for tensor indices (irrespectively of 
whether we are in Minkowski or Euclidean space), and repeated indices imply 
summation. For vectors in M” or R m we denote (u,v) := u^v 11 . Bold type indi¬ 
cates the contravariant representation of a Euclidean vector: u = (it 1 ,... ,u m ) = 
(—iti, • • - , —Um). We write u*v := u^v 11 = —(u , v). 

We use the “overline” to denote complex conjugation, T to denote transposi¬ 
tion, and * to denote Hermitian conjugation (combination of complex conjuga¬ 
tion and transposition). Thus, the Dirac conjugate of a bispinor 0 in our notation 
is ijj* 7 °. The norm sign without any additional indices indicates the L 2 -norm of 
a function; say, for a vector function u : —> C m we have ||u|| 2 = J u-u d m x . 

By in and e = — \e\ we denote the mass and the charge of the electron, respec¬ 
tively. The fine structure constant is a = e 2 ~ By A = (A 0 , A) we denote 
the electromagnetic vector potential, which is a given real valued vector function. 
Here A 0 = <f> is the electric potential, and A is the magnetic vector potential. We 
denote := d/dx M and P := iS7 — eA. 

Our system of units is such that the speed of light c and Planck’s constant h are 
both 1, whereas m ~ 1 (equivalently, Compton’s wavelength ~ 1). 

By e AM " and we denote the totally antisymmetric pseudotensor in M 3 and 

M 4 , respectively; we take e 012 = e 0123 = +1. In M 3 we define the dual of an 
antisymmetric tensor T as T*\ := \e\ flv T^ v , and in M 4 as T*'■= \z><\ 

In M 3 we define the vector product as [n,u>] A := e XfJ,1/ v l j,w v -, accordingly, curl := 
[V, • ], and curU := [V + ieA, ■ ] = —i[P, ■ ] . We define divergence on vectors 
and tensors as divzt := and (divT) 1 ' := V^T^, respectively. 

“Pseudo” refers to quantities which behave in the “correct” way (as scalars, 
vectors, or tensors) under changes of coordinates preserving orientation, and which 
get the “wrong” sign under change of orientation. 

By a = (a 1 , u 2 , a 3 ) we denote the “vector” of Pauli matrices, 


a 1 = 


0 1 
1 0 


a 2 = 


0 -i 
i 0 


^3 = 


1 0 
0 -1 


and by 7 = ( 7 0 , 7 1 ,7 2 ,7 3 ) the “vector” of Dirac matrices. In Sections 3 and 4 
we use the spinor representation of bispinors as opposed to the more common 
standard representation (see Section 21 of |LL4 | for details); consequently the 
Dirac matrices appearing in these sections are 


7° = 


0 I 
I 0 


= 


0 -cr^ 


At = 1,2, 3. 
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3. A tensor interpretation of bispinors 

A bispinor in M 4 is a set of four complex numbers 



which change under passive Lorentz transformations in the following way. Under 
proper Lorentz transformations £' = B £ and r / = {B*)~ l ip where B is a complex 
2x2 matrix given by 

B = exp (— 5 </>n-er) = cosh^^) — n-er sinh(^</>) 

for a boost of speed V = taiih(0) along the unit vector n g R 3 , and by 

B = exp mer) = cos(|t?) + m-cr sin(|t?) 

for a rotation by angle $ around n. Here “passive” means that we transform the 
coordinate system and not the the bispinors themselves, and the prime refers to 
the representation of the bispinor in the new coordinate system. Under space 
inversion = iij and rf = it;. Under time inversion £' = er 2 £, rf = a 2 rj. 

Remark 3.1. A full rotation of the coordinate system (i? = 27r) changes the sign 
of a bispinor, which means that bispinors are defined up to a choice of sign. 

Throughout this section we assume that we are dealing with a bispinor satisfying 
the technical condition r]*t ; ^ 0. 

Let t be a fixed real time-like vector and let {e^°\ e^ 2 ), e®}, (e( fe ),e(0) = g kl ; 

be some coordinate basis. Set r := sign(t, e^). In other words, to every coordinate 
system we assign a number r = +1 or r = -1. We assume that for the original 
coordinate system (and, consequently, for all coordinate systems obtained from 
the original one by proper Lorentz transformations and space inversion) r = +1. 
Thus, the vector t fixes the positive direction of time. 

Put 

P ■= I V£|, 0 ■= arg(2»?*£), (3-2) 

j M := ip* 7°7' 1 ^, u M := —iip T (3.3) 

zi 7 /(i) ;= 

p 1 p 

Theorem 3.2. The quantities 

{ P , e - /<*>, ae = 0,1,2} 



(3.5) 
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defined in accordance with 0-0 have the following properties: 
p £ M+ is a scalar; 

0 £ S 1 is a pseudoscalar; 

the f are real vectors form ing an orthonormal triad, i.e., (f^ k \f^) = g kl ■ 
Conver sely , given a set with the above three properties there is a unique 

bispinor satisfying fj3.fl) -(3.4). 

Of course, “uniqueness” of a bispinor is understood as “uniqueness up to the 
choice of sign”; see Remark [T~Tj. 


Proof of Theorem 3.2. Clearly /W and f^ are real whilst (y°y^)* = 7 ^* 7 ° = 
7 O 7 M f r onr which it follows that j and hence are also real. 

Given a Lorentz transformation A of the coordinate system let A^„, p, v = 
0 , 1 , 2 ,3, be the real 4x4 matrix in terms of which the vector a transforms to the 
vector a' with components given by 


= A 


For a boost of speed V = tanh(^) along the unit vector n £ M 3 

A = exp ( — = I + (cosh [cf) — l) M 2 — sinh(0)M„, 

where 


/O 


M n = 


n 

n 1 0 

n 2 0 
3 0 


\n 


n 

0 

0 

0 


\ 


tv~ 

0 

0 

0 / 


whilst, for a rotation by angle d around n, 

A = exp ( — dN a ) = I + (l — cos(d))N 2 — sm(d)N n , 


where 

/0 0 0 0 \ 

_ 0 0 — n 3 n 2 

™n — n 3 1 

0 n° 0 — rr 

\0 — n 2 n 1 0 J 

If the Lorentz transformation A is proper the corresponding transformation of 
the bispinor if is linear and so can be represented by a complex 4x4 matrix 5(A); 
that is if' = S(A)if. Clearly 


5(A) = 




where B( A) is the complex 2x2 matrix defined above. A straightforward check 
using the explicit forms of the matrices 5(A) and B( A) for boosts and spatial 
rotations gives us 


h» vl v = 5 _1 (A) 7 m 5(A) 


(3.6) 
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in these cases (see Section 7.2 of |^| for more details). Formula ( |3.6| ) immediately 
extends to all proper Lorentz transformations since any such transformation can 
be written as a product of boosts and rotations. On the other hand det (5(A)) = 1 

from which we get a 2 B~ 1 {A) = B T (A)a 2 and a 2 B*( A) = B( A) a 2 . Hence 


5*(A)7°=7°5- 1 (A), 

(3.7) 

5(A) V = 7 2 ^(A). 

(3.8) 


Now, for proper Lorentz transformations, 


v'*e = i(B*( A)) \)*B{A)S = r 1 *B- l {A)B{A)i = 

so p' = p and 9' = 9. Also, by ([b(]) and (|3.7D, 

j' 1 * = ip'*'y°j ll ip' = ip*S*(A)'y°p lx S(A)ip 

= ^S-^A^SiA)^ = ^* 7 °A' i 1/7 > = A 


whilst a similar argument using (3.8) as well gives 


u= —iip T S T (A)p°'y 2 'y fx S(A)ip 

= -iip T j°~S(A) ~ 1 ')‘ 1 ^S{A)ip = —iip T j 0 "/ 2 S~ 1 (A)"/ ll S(A)ip = A * v u v . 


Under space inversion £' = ip and t/ = i£ so r/*£' = £*p = p*£ and ip' = S(A)ip 

where 5(A) = ^ . Thus p' = p and O' = —9. On the other hand A l ' v = 

diag(+l, —1, —1, —1) which can be used to directly check that ( |3.C| ) to ( [3li| ) are 
still valid; it follows that = A^ v j v and u" 1 = A M „it". 

Under time inversion £' = er 2 £ and rf = a 2 p so p'*t;' = p *(er 2 ) <r 2 £ = p*£, 
giving p' = p and 9' = —9 once again. Now A M „ = diag(— 1, +1, +1, +1). We 
cannot extend the definition of 5(A) to cover the present case; however if we set 

* = (° a° 2 ) then 

ip' = S t 4>, A%r = -5 t -VS 4 , 5 t * 7 ° = 7°5 t -1 and S~ t ~\ 2 = 


Therefore 


i" 1 = Vs^rsup = -ip T ^A\r ip = -A\r = -A^f, 

(n.b. 7 ° and j are real) and 

u r>1 = —lip* Sj 7°7 2 7 /i S t ip 

= —iip*'f > St ^j^Stip = iip*'y°p 2 A' l v p v ip = A 
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However t' = -t so fW"* = for k = 0,1, 2. 

Since any Lorentz transformation can be written as a combination of a proper 
transformation, space inversion and time inversion, the above calculations show 
that p is a scalar, 6 is a pseudoscalar and is a vector for k = 0 , 1 , 2 . 

The fact that {f( k \ k = 0,1, 2} is an orthonormal triad can be checked directly 
by a somewhat lengthy calculation. However, since we have established that the 
/W’s are vectors, it suffices to prove they are orthonormal for one particular choice 
of the coordinate system. 

Claim: there exists a unique proper coordinate system in which 


iP 




( e if> / 2 \ 

0 

e -ie/2 

V o / 


(3.9) 


for some p £ R + and 9 £ S 1 . Of course, these p and 9 must be given by 
Indeed, we have 

3 

fju = (£*£ + V*V ) 2 - - r)*cr v ri) 2 


= 4(C 1 ??i + i 2 m) (C 1 Vi + fvi) = | V£| 2 = p 2 > o. 


Therefore the real vector j is time-likc so we can choose a proper coordinate system, 
given by an appropriate boost, in which j = 0 and j° = p ; n.b. j° = £*£ + 77*77 > 0 . 
Now suppose we rotate the coordinate system by an angle 9 £ R about the unit 
vector n = (n 1 , n 2 , 0) £ M 3 . From above we have £' = H£ where 

cos(4$) (n 2 + in 1 ) sin(4$) 

(—n 2 + in 1 ) sin(4^) cos(^$) 




Choosing n 1 , n 2 and d so that (—n 2 + m 1 )sin(4 ^)^ 1 + cos(4 $)£ 2 = 0 we thus 
have £ /2 = 0. Since we are only rotating the coordinate system we must have 
j 7 = 0 and j'° = p. An elementary analysis of these formulae gives us rf ^ = 0 and 

W. 11 = 1^1 = 4 

Suppose now that we are in the rotated coordinate system (and drop the primes). 
Rotate this coordinate system by an angle about n = (0,0,1). From above 

we have £' = Bt; and rf = (H*) -1 ?; where 


B = 


a 0 
0 a 


a = e^/ 2 . 


Therefore l;' 2 = = 0, £ ;1 = at; 1 and = a?yp Now, |? 7 ^ | = |^| so we can 

choose d such that £'* = However r]'*f = r]*l ; (since we have only changed 
the coordinate system by proper Lorentz transformations) so 2 p'*t;' = 2(£ 71 )” = 





The Dirac equation without spinors 


7 


2( 77 'j ) 2 = pe l9 , giving us the required representation (|3.9|) . In this coordinate 
system it is straightforward to check that 


/(°) = 

f 1 ) 

0 

0 

, / (1) = 

1 

0 

, / (2) = 

(°) 

0 

1 


w 


w 


w 


(contravariant representation). Clearly, (3.10) can be true only in one proper 
coordinate system. This completes the proof of the claim. 

Formula ( |3.10| ) shows that {f( k \ k = 0,1, 2} is an orthonormal triad. The proof 
of the direct statement of Theorem P is complete. 

Conversely, suppose we have a set (T5) with the required properties. Choos e th e 
proper coordinate system in which ( 3.10 ) holds and define the bispinor ?/’ by (3.9). 
It follows that this ip satisfies (3.2)—(3.4). Now, let ip be another bispinor satisfying 
|). The above claim associates with ip a special proper coordinate system 


([LHME 

in which ip is given by the right-hand side of ( [l.9| ). But ( 3.10 ) can be true only in 
one proper coordinate system, so the special coordinate system for the bispinor ip 
is the one we are already working in. This implies ip = ip. □ 


Set f^ V := e >!rA M !y /( 0 )^j( 1 ) A /( 2 )^ . We shall call the pseudovector spin. 
Clearly, {f^ k \ k = 0,1, 2,3} is an orthonormal tetrad (orthonormal frame). 


4. The Dirac equation in tensor form 


From now on our bispinor ip is a function of the point x in space-time. We 
assume that ip is defined on some open set O C M 4 , and that ip is smooth and 
satisfies t he t echnical condition 77 * £ 7 ^ 0. Our obje ctive is to rewrite the Dirac 
equation (0 in terms of the tensor quantities ( |3.5| ). 

It will be convenient for us to deal with Euler-Lagrange functionals rather than 
with the corresponding differential equations. It is well known that the Euler- 
Lagrange functional for (0) is 

J — TeA)^ — m)ip d 4 x (4.1) 

(we included the factor r bec ause the Dirac equation is CT-invariant, but not 
T-invariant). Integration in ( fl.l|) is carried out over O. We do not assume, 
however, that the integral converges and treat ( |4.l|) as a formal expression. This 
is acceptable for our purposes because we are interested not in (0) itself, but 
only in its variation generated by a variation of the bispinor Sip € Cq°(0). The 
same applies to all subsequent functionals. 

























Elton, Vassiliev 


Further on we use 


Re 


*7°(7* r (*V — reA)^ — m)^)d 4 x 


(4.2) 


instead of (4.1) a s th e Euler-Lagrange functional for 0)- This is possible because 
the variation of (4.1) is real. Incidentally, without the Re the integrand is not a 
scalar (time inversion leads to complex conjugation). 

Let us now examine the rotation of the orthonormal frame {f ik \ k = 0,1, 2, 3} 
as we move from one point x to another. Let 

Sf {k) „ = (V*f {k \)Sx v (4.3) 

be the increment of the vector function /^ when we move from a: to a close point 
x + Sx. We define the (antisymmetric) tensor of infinitesimal rotations R as the 
solution of the linear system 


Sf (k \ = R„xf (k) , k = 0,1,2,3. 
The explicit formula for the solution of ( |4.4| ) is 

A- 


3,1 =0 


In particular, formulae (4.5), (|4.3|) imply 


(4.4) 


(4.5) 


(div(i?*)r = , E fti^ A (VJ (i V)/ (l) ; 


3 , 1 =0 


As R* is a pseudotensor, di v(R*) is a pseudovector. 


(4.6) 


Theorem 4.1. We have the identity 


Reyp* , y 0 {j >c {iS7 — TeA) it — 

T 


(/ (3) , div(R*) +grad0) + e(/ (0) ,A) + mcosO 


p. (4.7) 


Proof of Theorem |4.l| . Since both sides of (4.7) are scalars it suffices to check 
the identity at each point x £ O in only one coordinate system. Choosing the 
coordinate system given by the claim in the proof of Theorem 3.2 (based at the 
point x) we have 

f {k \=9k^, k,p = 0,1,2,3; 


n.b. this, and other expressions to follow, are not tensor identities but rather 
identities which hold at the point x for our special choice of coordinates. Using 
this expression and ( |4.6| ) we have 

(div(i?*)) 3 = Vo/ (1) 2 - Vr/W 2 + V 2 / (0) r, 
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and hence 

p(/ (3) , div(iT) + grad0) = p(V 0 / (1) 2 - Vi/ (0) 2 + V 2 / (0) ! + V 3 0) 

= V 0 (p/ (1) 2 ) - Vi(p/< 0 > 2 ) +V 2 (p/(°> 1 ) -*e- ie V 3 (pe ie ) + tV 3 p. 

Combining the general definitions ( |3.2| ) ( [3.j| ) with the specific value of the bispinor 
■ 0 (a:) in our chosen coordinate frame (formula ©) we get 

V 0 (p/ (1) 2 ) = ^-FVo^ + ^VoF-WVom+mVo^i), 
Vi(p/ (0) 2 ) = *(? T V 1 e 2 -e 1 ViF-^iV 1% +7,^X7^), 

V 2 (p/ (0) i) = -FV^-^V^ + WV^ + TyiV^, 

e-* e V 3 (pe ie ) = 2FV 3 e 1 +27 ?i V 3 77i, 

V 3P = FV 3 F + tfiV 3 ??i +7?iV 3 7Ti. 


The above expressions can then be combined producing 
p(/ (3) , div(i?*) + grad#) 


= -2 Re 


{e (iVo ^ 1 + iVrF + V 2 f 2 + zV^ 1 ) 

+ ?7i(zVo7?i - 7 Vi??2 - v 2?? 2 - *V 3 77i)} 


/ 0 
0 / 


7 V n 


E 


O v 0 
0 -<J l 


= -2Re|(r 7?*) 

= -2 Re (t/>* 7 ViV^) • 

From the definition of /we have 

pe(/ (0) , A) = p/ ( 0 )A 1 eA M = V>* 7°7 ^reA^if 




whilst the definition of p and 0 gives 


pmcosO = 777 .Re(277*£) = m (£* 77 *) 


0 A F 


= mip*"f 0 ip. 


, / 0 / y?7y 

The result now follows from the fact that both of these expressions are real. 
Formulae (|4.2|) and (4.7) imply 


□ 


Corollary 4.2. TTze Euler-Lagrange functional for the Dirac equation can be 
written as 


j l -{f^\ div(i?*) + grade) + e(fW,A) 


I.COS 0 


pd 4 x. 


(4.8) 
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Variation of (4.8) with respect to the scalar p, pseudoscalar 6 and the moving 
frame {f^ k \ k = 0,1,2, 3} produces a (nonlinear) system of tensor differential 


equations equivalent to the Dirac equation (1.1). Of course, in performing this 
variation one has to remember the constraints: all the quantities are real, p is 
positive, and = g kl ■ 


5. Simplifications in the case of M 3 


Let us return to the Dirac equ ation 0 ) and consider the case when A and ip 
do not depend on a; 3 . Then (0) separates into two systems of two equations : 


Pn 


Pm 


0 J T 

~P± -Po 


<P± 

X± 


<P± 

x± 


(5.1) 


where P± := Pi ± 1 P 2 and 


¥>+ 

V- 


Z + V 

V 2 


X- 

x+ 


Z-v 

V 2 


(The relation with stan dard n otation is <p+ = <Pi, <P- = <P 2 , X- = Xi) X+ = X' 2 ; 
see formula (21.17) in |LL4|.) Accordingly, simplifications occur in the tensor 
functional ( |4.8| ). We give the final result omitting intermediate calculations. 

We are now working in M 3 and the particle is described by the set of quantities 


{p; /<*>, fc = 1, 2} 


(5.2) 


where p £ R + is a scalar and the are real vectors forming an orthonormal 
dyad, i.e., (/(^/(O) = g kl _ As we are in a 3-space it is convenient to use the 
notion of a vector product. Put := and define the pseudovector 

of infinitesimal rotations r as the solution of the linear system Sf^ = [r, 
k = 0,1,2. Then r = — R* where 


(cf. (4.5)), and 


= E A 


3 , 1=0 


divr = E 3 ii(/ 0 \curl/ (i) ) 
1 3,1=0 


(5.3) 


(5.4) 


(cf. (L6)). The functional (L8) turns into 
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where the sign corresponds to that in ©■ The functional(s) (^) should be 
varied with respect to p and the moving frame k = 0,1,2}. 


Remark 5.1. The /(°) from this section is a pseudovector in M 3 , and it coincides 
up to sign with the corresponding part of the 4-vector /i 0 ' from Sections 3 and 4. 


Remark 5.2. The set (522) is equivalent to a complex valued vector function u 
satisfying the constraint (u, u ) = 0; the equivalence is established by the formula 
u = p(f ^ + */^), cf. (3.4). This is not surprising: Cartan originally defined 
spinors as complex vectors u satisfying (u,u) = 0, see Section 52 in jc]]. 


6. A model equation in M 3 


The arguments in this section are not mathematically rigorous, and are needed 
to motivate the introduction of the equation (0. 

Let a —> +0 be an asymptotic parameter. Assume that the external electromag¬ 
netic field A is smooth and satisfies 

eA ~ a 2 , d^A^a^A, (6.1) 

where (3 = (/3o, • ■ ■ ,Pz) is an arbitrary multiindex, |/3| = /3q + ... + /? 3 , A = 
d^A^ = (V 0 )^° ■ ■ • (V 3 )P 3 A 11 , and stands for “asymptotically of the order of”. 
In other words, we assume that the field is weak (potential energy of the electron 
~ a 2 ) and slowly varying (each differentiation gives an additional a). 

Our scaling assumptions (0 are meant to model the situation which occurs in 
the hydrogen or positronium atoms. Indeed for the hydrogen atom 

eA° = e4> = — or -1 , A = 0, (6.2) 

where r = ^/(x 1 ) 2 + (a: 2 ) 2 + ( x 3 ) 2 . But the characteristic length associated with 
the wave functions of bound states is 


(6.3) 


The latter is established by elementary analysis of the corresponding Schrodinger 
equation; say, the wave function of the ground state is 4/ = e ~ arnr . Formulae 

(0), (0 imply (PI). 

Of course, such arguments should be treated with a fair degree of caution as 
the Coulomb potential has a singularity at the origin. However, in theoretical 
physics it is common to disregard this technical difficulty, and it is known (see, 
e.g., Sections 33 and 34 in [ LL4 1) that one can get very sharp results on the basis 
of formal asymptotic calculations of the type & (0. 
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The question we address now is whether it is possible to suggest simple tensor 
equations which would be asymptotically equivalent (up to a certain accuracy in 
powers of the small parameter a) to the Dirac equation. In our search we accept 
equations whose algebraic structure may be totally different from that of the Dirac 
equation, as long as th ey hav e (asymptoti cally ) the req uired spectral properties. 

Examination of (|5.5| ) , ( [5.4|) an d Remark |A2] suggests (U) as the natural candi¬ 
date in M 3 . Let us rewrite ( |l.2| ) as 

— i[P, ?/] = ±mu, (6-4) 


and formally analyse the properties of this equation. Rigorous mathematical ana¬ 
lysis is deferred till the next section. 

Let us first make some general observations. 

Observation 1: the equation & is not algebraically equivalent to the Dirac 
equation (5.1). This is clear from the fact that the number of equations in ( |6.4|) ( 3 
equations) and (5T) (2 equations) is different. Also, there are no spinors inffM). 

Observation 2: the equation ( 6 . 4 ) is Lorentz invariant. In fact, ( |6.4| ) is prob¬ 
ably “more invariant” than the Dirac equation because it can be used in curved 
space-time: the notions of vector product and curl are defined on any pseudo- 
Riemannian 3-nranifold. 


Observation 3: the equation ( 6 . 4 ) is formally self-adjoint: 


Moreover, it has an Euler-Lagrange functional which can be written as 

/ {u,-HPAPmu)^ o, /pe< n ,-i[P,„] Trau >^. 


Equation (6.4) can be presented in matrix form as 


0 P 2 -Pi 
P 2 0 P 0 
v-Pi -Po 0 



= ±im 


The first row gives u° = =pi(P 2 ii 1 — Plu 2 )/to which can then be used to eliminate 
u° from the remaining two rows. This results in a 2 x 2 second order system of 


equations which is equivalent to equation (6.4): 


m 2 + P 2 2 
=FimP 0 - PiP 2 


±itoPq — P 2 P 1 
to 2 + Pi 2 


= 0 . 


(6.5) 


Now, suppose we are looking for bound state solutions; that is, assume that 


A does not depend on x° and u is o f the form u(x 1 ,x 2 )e 
functions we have Pq = s — e<h , and (3.5) reduces to 


For such vector 


Mu = eBu 


(6.6) 
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where 


A := 


( TO 2 + P 2 2 
l ±ime<& — P 1 P 2 


=F *me$ — P2P1 


m 


a ( 0 =F im\ 

' Viiw 0 ) ’ 


u = 


C 2 , 


and e is the spectral parameter. Note that A is not elliptic. 

Observation 4 : equation \ 6 .f\ ) asymptotically reduces to the Pauli equation. In¬ 
deed, the unitary transformation 

,1N 


1 

V 2 


1 1 

i —i 


V?+ 

V?- 


turns (643) into 

I m 2 + P ~f + ± me$ =F sen 


p±_ 

2 


m 


P+P- 


_p_4 

2 


=F rne<3? ± em 



= 0. 


( 6 . 7 ) 


The latter system formally reduces to the scalar equation 

-1 


PfP± 

2 m 


e$ - 


4?n 2 T 


2 / _ , _ , P ± Pp 


m + e + 


2 m 


— e$ 


P± 2 ]<P± = (s - 


Suppose we are looking for the bound states of the electron, so that e ~ +m, and 
suppose that our eigenfunction inherits the slow variation property of the potential, 

■ v ± 


Pi^ 1 P 2 ^ 2 v?± ~ a^ 1 + ^ 3 a <p±. Then the above scalar equation can be rewritten as 


PpP± 

2 m 


+ e& + 0 (a 4 ) I tp± = (e — m)(pi 


( 6 . 8 ) 


But P T P± = P 2 =F eH 3 , where P 2 := Pi 2 + P 2 2 , and H 3 = ie- 1 {P 2 Pi - P 1 P 2 ) = 
V 2 A 1 — Vi^ 4 2 is the intensity of the magnetic field, see formula (23.5) and list of 


notation in |LL2|. Therefore, ( pTq ) takes the form 

'P 2 =Fe# 3 


2 m 


+ e<!> + 0(a) I tp± = {e — m)<pi 


( 6 . 9 ) 


which is the Pauli equation perturbed by the O(o 4 ) term; note that our scaling 
assumptions dj imply eH 3 ~ a 3 , so the magnetic term cannot be included in 


0(a 4 ). Similar arguments (see also Section 33 in |LL4|, as well as Theorem 6.8 in 


@) reduce ([□]) to ( |6.9|). T hus, we have (formally) shown that the energy levels 
of our model equation (3.4) and those of the 2-dimensional Dirac equation (H 
are related as 

^model = ^Dirac T 0{ot ) . ( 6 . 10 ) 

Normally one subtracts the rest mass from e and deals with E := e — m ~ a 2 , so 


(6.10) means that the relative accuracy in the determination of E is ~ a 2 
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7. Spectral properties of the model equation 


Throughout this section we assume that the electromagnetic vector potential A 
does not depend on or, is smooth and vanishes at infinity. In addition we assume 

||e$||i~<m, (7.1) 

and that the first derivatives of A vanish at infinity. 

Define the operator V formally by 

p =(Jfp 2 ~pf) = (h 2 ) «)=<-ft «)' <-ft ft) - 

More precisely, we consider first the nonnegative symmetric operator Co°(R 2 ) —> 
L 2 (R 2 ) given by the above expression, and define V as the Friedrichs extension of 
the latter; see Theorem 4.4.5 from Thus, V : D(V ) —> L 2 (R 2 ) is a nonnegative 
self-adjoint operator defined on some D(V) D S(R 2 ) D Cq°( R 2 ) (here S stands 
for the Schwartz class). 

We have A = V + m 2 I + and the operator A : D(A) —* L 2 ( R 2 ), D(A) = 
Dip), is self adjoint. Furthermore, condition 0) means that ||e<b£>|| < m 2 , so 
the operator A is positive definite: a := inf cr{A) > 0. Therefore A~ * is a bounded 
self-adjoint operator in L 2 (R 2 ). We can now rewrite ( l6.6| ) as 

A~^BA~^v=-v. (7.2) 

£ 

1 1 

The latter spectral problem is well posed because A~^BA~^ is a bounded self- 
adjoint operator in L 2 (R 2 ). 


Theorem 7.1. The essential spectrum of the operator A 2 BA 2 is the interval 


[—m 1 ,m 
Theorem 


n|fe| 


says, in effect, that for potentials which are sufficiently weak, 
smooth and well behaved at infinity, the basic structure of the spectrum of our 
model equation (|6.4|) is the same as that of the Dirac equation (5.1). 


Proof of Theorem [tt| . 

Part 1 of the proof. Let us prove 


[-m 1 ,m 1 ] C cress (A 2 BA =). 


(7.3) 


Let e be an arbitrary real number such that |e| > m. Suppose we have a sequence 
{ u ( n )} such that 

u ( ") e C“(R 2 ), 

ll u(n) || = !> 


(7.4) 

(7.5) 
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| [A — eB)vl 


(™) I 


0 


(7.6) 


(a sequence of approximate eigenfunctions of problem (6.6)). Put v( ra ) = A 2 £ 

D(A?) C L 2 (R 2 ). We have ||v( n )|| > ah and 


||(e- 1 -.A-*B./H) V (n >|| = < lerl- 1 ^^!!(>t —eH)u<” ) ||, 

so ||(e _1 — A~^BA~^)'v^\\ —> 0. This implies e _1 £ cr(A~?BA~i). As e _1 is an 
arbitrary number in (—m _1 , m _1 ) \ {0} we arrive at ©■ Thus, we have reduced 
the proof of © to the construction of a sequence with properties ( |7.4| ) (|7.6|). 

Let us denote by Vq the operator V in the case A = 0, and by A the operator 
A in the case A = 0; of course, Aq = Vq + m 2 I. 

Suppose we have a sequence {A")} with properties (0, (0) and 

|| (A - £B)u^\\ —> 0. (7.7) 


Let us modify this sequence by translating the functions, that is, by replacing 
each u(")(a;) by iA)(x — a;( n )), where x^ £ JR 2 . Clearly, if the sequence of points 
\x^} is chosen to tend to infinity sufficiently quickly, then ||(A — A)u (n )|| —► 0 
and (777) will imply (0. The proof of (|7.3|) ha s been reduced to the construction 



of a sequence with properties (0)- 0M 
As functions of Schwartz class can be approximated by C( 
relax the condition (0) by replacing it with 


functions, 


we can 


u 


(n) 


! ). 


(7.8) 


The differential operator A — eB has constant coefficients, so it is natural to 
switch from the W"^*) to their Fourier transforms 


fi (n) (0 


1 

2tt 


e -i{x,f) u (n) 


(a;) d 2 x , 


{x,0 = x 1 ^! + x 2 &- 


We now have to construct a sequence -{A 71 )} such that 

> 2 ), 


U (n) g 


(7.9) 


u 


= 1, 


| (A — eB)u 


Ml 


0, 


(7.10) 

(7.11) 


where 


A 0 -eB = AC£) -eB = 


( e 2 +m 2 

\-£i& T ism. 


-&£i ± iem\ 
+ m 2 ) 


is the (full) symbol of A - sB. We have det(A(£)~ e ®) = m 2 (^ 2 +^|+m 2 —e 2 ) and 
|er| > m, so we can choose an rj to give det(A(??) — sB) = 0 . Let u be a normalised 
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(constant) vector in the null space of Ao(rj) — eB, and let be a sequence 

of scalar functions such that £ Cq°(R 2 ), ||^")|| = 1, and supp</>6) —y {ry}. 
It is easy to see that the vector functions u*”)(£) := have the required 

properties 

Part 2 of the proof. Let us prove a ess (A~^ BA~^) C [—m , m -1 ]. 

Since ||S|| = m it is sufficient to show t7 e ss(A) C [m 2 ,+oo). However, A = 
V + m 2 I + e$B, so, in turn, it is sufficient to show 


a ess (V + TH) C [0, +00), 


(7.12) 


where $ = e<fi is an arbitrary smooth real valued function vanishing at infinity. 
Claim: for any \E r £ Cq°(R 2 ) we have 

(7.13) 


Cess (fPo + I + H'H) C [ 0 , +00) . 


Indeed, let us define the operators Q 0 := Vq + I, 77 := Q 0 2 '$’BQ 0 


The 


operator Q 0 is a positive definite differential operator with constant coefficients, 
and its symbol is 

'£ + 1 -C2C1' 

,-66 6 2 + 6 


Qo(C) = 


Therefore Q 0 2 is a pseudodifferential operator with symbol 


< 2 o*(£) = 


1 0 
0 1 


1 


6 2 


- 66 \ 

-(l+s) V-66 Ci ) 


= (i + 6 2 + 6 2 )6 


The principal symbol of the operator Q 0 2 is 

1 


M(6 = 


1 0 
0 1 


C 2 + C 2 V-66 


-6C1 

C1 2 


so Q 0 2 is a pseudodifferential operator of order 0. It follows that 77 is a pseudo¬ 
differential operator of order 0. The principal symbol of 77 is 


± imM(6 


0 -ff-(x) 

ff'(x) 0 


M(C) = 0, 


so 77 is in fact a pseudodifferential operator of order -1. Now, the Schwartz kernel 
K-jz(x, y) of the operator 77 is given by the oscillatory integral 


± 


(2?r) 2 


QoMC)e 


i(x—z,£) 


0 -ff'(z) 

T(z) 0 


Q 0 2 (77) e^ z - y ^d 2 id 2 zd 2 r] . 


Let SI C K 2 be a bounded open set with supp’I' C O. A standard calculation 
shows that K-ji(x,y) is smooth and rapidly decreasing in both x and y outside 
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f Ixfl (n.b. this calculation relies on the fact that the matrix function Q 0 2 (£) is 
smooth on the whole of 1R 2 , including the point £ = 0). Since K-ji(x, y) is the 
integral kernel of a pseudodifferential operator of order -1 inside fl, it follows that 
the operator 1Z is compact. Hence the operator I+ 1Z is nonnegative on a subspace 
of finite co-dimension. Consequently, the quadratic form 

J v*(I + n)vd 2 x, veL 2 (K 2 ), 

is nonnegative on a subspace of finite co-dimension. Equivalently, 

j (Qq u)*(I+ K)Q£ud 2 x, u eD(V£), 

is nonnegative on a subspace of finite co-dimension. But th e latt er is the quadratic 
form associated with the operator Vo + I + EH. Formula ( 7.13 ) now f ollows. 

Let us now remove the restriction E £ Qj^R 2 ) and show that ( 7.13|) still holds. 
Given an arbitrary e > 0 we can decompose f as $ = where Eo £ C£° 

and || 'Ll ||l°° < m~ 1 e. We have 

V 0 + / + EH = {Vo + I + E 0 H) + #i H. 

According to ( 7. 13| ) <J ess (Vo + I + E 0 H) C [0, + 00 ), whereas E X H is a bounded 
operator with ||'E 1 £>|| < e . Th is implies a ess (Vo + I + EH) C [—e, + 00 ). As e > 0 
is arbitrary we arrive at ( |7.13| ) for general E. 

Now, given an arbitrary S £ (0,1) we have 

Vo + EH > SV 0 + EH = 6 (Vo + 1 + <5 _1 EH) - SI. 


According to (|7.13|) a ess (Vo + I+S X EH) C [0,+ 00 ), so cr eS s(Po +EH) C [-5,+ 00 ). 
As <5 £ (0,1) is arbitrary we conclude that 


& ess (Vo + EH) C [0, + 00 ). 


(7.14) 


Thus, we have proved ( 7. 12j ) in the case A = 0. Let us now remove this restric¬ 
tion. For any v > 0 and any smooth vector function u we have pointwise 


\P 1 u 2 -P 2 u 1 \ 2 = 


Vi u z — V 2 u ^| 2 ^ ve 2 \Aiu 2 — A 2 uT + \Piu z — P 2 u 


11 2 


l + i/ 


,H 2 


where P* := *(1 + v) 2 — e(l + v) 2 A*, , tt= 1,2. This implies 


u*Pu d 2 x > 


l + y 

with C = e 2 ||Ai 2 + A 2 211 
f u*(V + ^B)ud 2 x > 


J u*P 0 u d 2 x — C* || u|| 2 , 

Loo . Consequently, 

u*(V 0 +^B)ud 2 x-Cu\\u\\ 


U £ Cn 


l + y 


u£C 0 oo (R 2 ), 
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with H/ = v 1 (1 + v)^. The latter formula a nd ([7.14 ) imply cr e ss(’P + v l / £’) 


[—Cv 7 +oo). As v > 0 is arbitrary we arrive at (7.12). 


C 

□ 


Comprehensive analysis of the discrete spectrum of the operator A~^ BA^^ is 
a non-trivial task which lies outside the scope of this paper. We shall, however, 
briefly deal with the most basic situation. 

Suppose £ <S (R 2 ) and satisfies (□). and suppose A = 0. Let e 1 , |er| < m, be 
an eigenvalue of (|7.2| ). Then , using arguments similar to those in the second part 
of the proof of Theorem 7.1, one can sho w th at u g 6> (R 2 ). Therefore, in studying 
the discrete spectrum we can work with (3.6) or © rather than with ( |7.2| ). 

Suppose now $ is radially symmetric. Let us introduce polar coordinates x 1 = 
r cos'd, x 2 = rsind, and expand the ip± as 


ip± = E e i(fc T i )^«( r ). (7.15) 

feez 

Substituting these expansions into (0 we see that the latter separates into sys¬ 
tems of ordinary differential equations 


Hti 

G- k 


G k \ 



(7.16) 


where k runs through Z, and 


1 (1 d ^ d 

2m \r dr dr 



e$), 


Gi := 


1 

2 TO 


Id d 
r dr dr 


21 d 
r dr 



The orthogonal transformation 



-f 1 



turns ( 7.1(j ) into 


-a' + \g~ ~2 f ~ m2 f ± m ( £ ~ e ®)9 = ° ^ 

j' r* 7 *^ 


g" + V - -^g - */' + ^f ~ m 2 g ± m(e - e$)/ = 0, 


(7.17) 

(7.18) 


where the prime stands for the derivative in r. Resolving (7.17) with respect to / 
and substituting the resulting expression into (|7.1g|), we reduce our problem to a 
















The Dirac equation without spinors 


19 


single second order equation 


3k 2 + m 2 r 2 , 


g 


( k 2 + m?r 2 )r 

k 2 — m 2 r 2 k 2 , 2 km(e — ed>) , 9 \ 

tT/TT—^T ~2 “ T ± ~7¥T—±-+ (e - e$) - m U = 0. 

(fc- + TO z r~ )r z r' 2 /c- 2 + m z r z mr / (7 19) 


Let us compare our model equation (7.19) with the Klein-Gordon equation 

(7.20) 


g" + ^g' + (--^2 +( £ - e$ ) 2 -m 2 ) g = 0 , 


n £ Z. In view of (6/7), ( 7.15 ) it is natural to compare the two equations taking 
\n\ = | fc=F 11 when we are looking for the bound states of the electron (e ss +m ), and 
|n| = |fe ± 1| when we are looking for the bound states of the positron (e « —m). 
Clearly, for k = 0 (7.19) and ( |7.20| ) coincide. For k ^ 0 these equations differ 
and we shall compare their spectra asymptotically, assuming that e<f> is of the 
form a 2 H/(ar), T £ <S(R 2 ), a —> +0; this is a particular case of the situation 
dj). Asymptotic analysis gives e mo dei = £kg + 0(a 4 ), which in turn implies 


£model — ^Dirac 0(& )• 

We hope to justify a version of in the case of more general $ and A by 

means of asymptotic perturbation techniques (viz. Chapter VIII of !)• Here the 
technical difficulty is that in contrast to the Dirac operator which can be viewed 
as an analytic perturbation of th e Pauli operator (see Chapter 6 of |t) for more 
details) the perturbation in (3.9) is not analytic. 


8. Conclusion 


The results of Sections 6 and 7 seem to indicate that the basic effects attributed 
to spinors can be explained (at least in Minkowski 3-space) using simple tensor 
models. This observation may be useful in relation to attempts at modelling the 
electron as a soliton-like solution of some nonlinear system of partial differential 
equations. The usual approach, see, e.g., |Q and [EGS|, involves the so-called 
Maxwell Dirac equation. In our view, it might make sense looking also at other 
nonlinear systems which do not necessarily have spinors occurring explicitly but 
may still produce spinor effects. With this goal in mind, let us compare our model 
equation © with the Maxwell system. 

Define the operator of exterior differentiation d mapping vectors to antisymmet¬ 
ric tensors as (du)^ v := zV— iV v u^, and its dual S as (6 T) V := zV /i T ,f “ / ; 
we also define the action of S on vectors as Su := iS/^u^. By analogy, de¬ 


fine ( dAU ) 


[l V 


:= P M u„ - P v u M , {SaT) v := P^ 


Now, squaring (0) 


gives 
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curl,* u = mfu . But ciuIa^ = Sa<1a, so our model equation (1.2) becomes 

SacIau = m 2 u. 


( 8 . 1 ) 


According to formula (30.2) of [ LL2 ] the Maxwell system can be written as 

SdA = —47 rj , 5A = 0, (8-2) 


where j is the (given) current. The similarity between (8T) and ( 8^2 ) is remarkable, 
and we hope to build further mathematical models on the basis of this similarity. 
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